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Abstract 

We study the problem of estimating a mean pattern from a set of similar curves in the 
setting where the variability in the data is due to random geometric deformations and additive 
noise. This problem requires to define non-Euclidean distances by using the action of a Lie 
group on an infinite dimensional space of curves. This approach leads to the construction 
of estimators based on the notion of Frechet mean that is a generalization of the standard 
notion of averaging to non-Euclidean spaces. A recent research direction in nonparametric 
statistics is the study of the properties of the Frechet mean in deformable models, and the 
development of consistent estimators of a mean pattern. Using such models, we show the 
links that exist between minimax theory in nonparametric statistics and the problem of 
estimating a mean pattern from a sequence of curves. 

Keywords: Frechet mean, Non-Euclidean metric, Deformable models. Lie group action. Curve 
registration, Minimax rate of convergence, Sobolev balls. Functional data analysis. 
AMS classifications: Primary 62G08; secondary 62G20. 

Acknowledgements 

The authors acknowledge the support of the French Agence Nationale de la Recherche (ANR) 
under reference ANR-JCJC-SIMIl DEMOS. 

1 Introduction 
1.1 Frechet means 

The Frechet mean [Fre48] is an extension of the usual Euclidean mean to non-linear spaces 
endowed with non-Euclidean metrics. If Yi, . . . , Yj denote i.i.d. random variables with values 
in a metric space A4 with metric d_A4, then the empirical Frechet mean Y^ of the sample 



Yi , . . . , Y J is defined as a minimizer (not necessarily unique) of 

- 1 ^ 

Ym e argmin - S^ d)^{y ,Y j) . 

For random variables belonging to a nonlinear manifold, a well-known example is the computation 
of the mean of a set of planar shapes in the Kendall's shape space [Ken84] that leads to the 
Procrustean means studied in [Goo91]. A detailed study of some properties of the Frechet 
mean in finite dimensional Riemannian manifolds (such as consistency and uniqueness) has been 
performed in [Afsll, BP03, BP05, Hucll]. However, there is not so much work on the properties 
of the Frechet mean in infinite dimensional and non-Euclidean spaces of curves or images. In 
this paper, we are concerned by the nonparametric estimation of a mean pattern (belonging to 
a non-linear space) from a set of similar curves in the setting where the variability in the data is 
due to random geometric deformations and additive noise. 

More precisely, let us consider noisy realizations of J curves /i, . . . , /j : [0, 1] — t- M sampled 
at n equispaced points = ^, £ = 1, . . . , n, 

Ytj = fj{te) + e£,j, £ = l,...,n, and j = 1, . . . , J, (1.1) 

where the e^j's are independent and identically distributed (i.i.d) Gaussian variables with zero 
expectation and variance o"^ > 0. In many applications, the observed curves have a similar 
structure that may lead to the assumption that the /j's are random elements varying around 
the same mean pattern / : [0, 1] — )• M (also called reference template). However, due to additive 
noise and geometric variability in the data, this mean pattern is typically unknown and has 
to be estimated. In this setting, a widely used approach is Grenander's pattern theory [Gre93, 
GM07, TY05, TYll] that models geometric variability by the action of a Lie group on an infinite 
dimensional space of curves (or images). 

When the curves /-,• in (1.1) exhibit a large source of geometric variations in time, this may 
significantly complicates the construction of a consistent estimator of a mean pattern. In what 
follows, we consider the simple model of randomly shifted curves that is commonly used in many 
applied areas such as neurosciences [TIRll] or biology [R0nOl]. In such a framework, we have 

fj{t) = fit - e*), for aU t G [0, 1] and j = 1, . . . , J, (1.2) 

where / : [0, 1] — t- M is an unknown curve that can be extended outside [0, 1] by 1-periodicity. 
The shifts O^s are supposed to be i.i.d. real random variables (independent of the e^j's) that are 
sampled from an unknown distribution P^. on M. In model (1.2), the shifts 6j represent a source 
of geometric variability in time. 

1.2 Model and objectives 

The main goal of this paper is to construct nonparametric estimators of the mean pattern / from 
the data 

Ye,j = f{te - Oj) + eej, i=l,...,n, and j = 1, . . . , J, (1.3) 
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in the setting where both the number J of curves and the number n of design points are allowed 
to vary and to tend towards infinity. 

In the sequel of the paper, it will be assumed that the random shifts are sampled from an 
unknown density g with respect to the Lebesgue measure d9 (namely d¥^:{6) = g(9)d9). Note 
that since / is assumed to be 1-periodic, one may restrict to the case where the density g has a 
compact support included in the interval [— ^, ^] . Under the assumption (1.2), the (standard) 
Euclidean mean Yi = j X]/=i °f data is generally not a consistent estimator of the mean 
pattern f at t = ti. Indeed, the law of large numbers implies that 




a.s. . 



Thus, under mild assumptions on /, one has limj^oo 7^ /(^^) ^■•S- ■ 

To build a consistent estimator of / in model (1.3), we propose to use a notion of empirical 
Frechet mean in an infinite dimensional space. Recently, some properties of Frechet means in 
randomly shifted curves models have been investigated in [BGIO] and [BCll]. However, studying 
the rate of convergence and the minimax properties of such estimators in the double asymptotic 
setting min(n, J) — t- +00 has not been considered so far. 

Note that model (1.3) is clearly not identifiable as for any 9 £ [~^) ^] > one may replace /(•) 
by /(■) = /(■ — ^) and by 9j =9*^—9 without changing model (1.3). Therefore, estimation of 
/ is only feasible up to a time shift. Thus, we propose to consider the problem of estimating its 
equivalence class [/] (or orbit) under the action of shifts. More precisely, let Lpg^([0, 1]) be the 
space of squared integrable functions on [0, 1] that can be extended outside [0, 1] by 1-periodicity. 
Let §1 be the one-dimensional torus. We recall that any element r = t{9) E Si can be identified 
with an element 9 in the interval [— ^, | [■ For / G Lpgj,([0, 1]), we define its equivalence class by 
the action of a time shift as 

[/] := {r, TGSi}, 

where for r = t{9) G Si (with 9 G [-\,\[), Fit) = f{t - 9) for all t G [0,1]. Let /, /i G 
Lpg^([0, 1]), we define the distance between [/], [h] G Lpg^([0, 1])/Si as 

d([/], [h]) = inf ( |/(t -9)- h{t)\''dt] ' . (1.4) 

In the setting of Grenander's pattern theory, (Lpg^([0, l])/§i, d) represents an infinite dimensional 
and non- linear set of curves, and §1 is a Lie group modeling geometric variability in the data. 

1.3 Main contributions 

Let us assume that F C Lpgj,([0, 1]) represents some smoothness class of functions (e.g a Sobolev 
ball). Suppose also that the unknown density g of the random shifts in (1.2) belongs to some 
set Q of probability density functions on [— ^, ^] • Let be some estimator of / based on the 
random variables Y^j- given by (1.3) taking its values in Lpg^([0, 1]). For some f £ the risk of 
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the estimator fn^j is defined by 

7^,(/„,J,/)=E5 (rf'([/n,j], [/])), 

where the above expectation is taken with respect to the distribution of the Yij^s in (1.3) and 
under the assumption that the shifts are i.i.d. random variables sampled from the density g. We 
propose to investigate the optimality of an estimator by introducing the following minimax risk 

T^n,j{^,Q) = inf supsup7^g(/„,,J,/), 

fn,J 9&Q /GJ- 

where the above infimum is taken over the set of all possible estimators in model (1.3). 

For / G Lpg^([0, 1]), let us denote its Fourier coefficients by Ck = f{t)e~^'^'^''^dt, k e Z. 
Suppose that is a Sobolev ball of non-constant functions with degree of smoothness s > 1/2, 
namely 

T = Ws{A,c.) = J / G Ll,,{[0, 1]); ^ (1 + \k\^') |cfc|2 < with |ci| > i 
I kez ) 

for some positive reals A and c*. 

Moreover, let Q'^ be a set of probability densities having a compact support of size smaller 
than n with < k < 1/8 defined as 

/■1/2 1 
^ IR+ ; / g{9)d9 = 1, and supp(5) = [-k/2, k/2] ) . 

J~l/2 J 

Suppose also that the following condition holds 

J >i n"', for some < q < 1/6, 

where the notation J x means that there exists two positive constants C2 > ci > such that 
ci?i" < J < C2n." for any choices of J and n. 

Then, under such assumptions, the main contribution of the paper is to show that one can 
construct an estimator fn^j based on a smoothed Frechet mean of discretely sampled curves that 
satisfies 

limSUp J sup sup Tlg{fn,J, f) < Co 

min(n,J)^+oo ' g&G'^ f£W4A,c,) 

where Cq > is a constant that only depends on A, s, k, c^, and o"^. The rate of convergence 
rn,j is given by 

2s 

The two terms in the rate r„^j have different interpretations. The second term (nJ) 2s+i is the 
usual nonparametric rate for estimating the function / (over a Sobolev ball) in model (1.3) that 
we would obtain if the true shifts 6^, . . . ,9j were known. 

The first term in the rate r„^j can be interpreted as follows. As shown later in the 
paper, the computation of a Frechet mean of curves is a two step procedure. It consists in 



1 1 
2' 2 
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building estimators 6j of the unknown shifts and, then, in ahgning the observed curves. For 

e = {01,..., 0j) £ R-\ let us define the Euclidean norm \\9\\ = (Ej=i l^jP) ■ One of the 
contributions of this paper is to show that an estimation of the vector 

1 

0° = {61 -Oj, ...,e*j- ej)' e RJ, where 0J = -.^9* , 

is feasible at the rate for the normalized quadratic risk jE||^'' — O^W"^ and that this allows 
to build a consistent Frechet mean. If the number J of curves was fixed, would correspond 
to the usual semi-parametric rate for estimating the shifts in model (1.3) in the setting where 
the O'- are non-random parameters, see [BLVIO, GLM07, VimlO] for further details. Here, this 
rate of convergence has been obtained in the double asymptotic setting min(n, J) — t- -|-oo. This 
setting significantly complicates the estimation of the vector 6^ G since its dimension J is 
increasing with the sample size nJ. Hence, in the case of min(n, J) — t- -|-oo, estimating the shifts 
at the rate is not a standard semi-parametric problem and we had to impose the constraint 
J n°' (with < a < 1/6) to obtain this result. The term in the rate r„^j is thus the 
price to pay for not knowing the random shifts in (1.3) that need to be estimated to compute a 
Frechet mean. 



1.4 Organization of the paper 

In Section 2, we define a notion of smoothed Frechet means of curves. We also discuss the 
connection between this approach and the well known problems of curve registration and image 
warping. In Section 3, we discuss the rate of converge of the estimators of the shifts. We also 
build a Frechet mean using model selection techniques and we derive an upper bound on its rate 
of convergence. In Section 4, we derive a lower bound on the minimax risk and we give some 
sufficient conditions to obtain a smoothed Frechet mean converging at an optimal rate in the 
minimax sense. In Section 5, we discuss the main results of the paper and their connections with 
the nonparametric literature on deformable models. Some numerical experiments on simulated 
data are presented in Section 6. The proof of the main results are gathered in a technical 
Appendix. 



2 Smoothed Frechet means of curves 

Let /i, . . . , /j be a set of functions in Lpg^([0, 1]). We define the Frechet mean [/] of [fi], . . . , [fj] 
as 

J 



argmin l^Y.d\[f],[f,]). 



[/]GL2„([0,l])/§i J 

It can be easily checked that a representant / G Lp^^{[0, 1]) of the class [/] is given by the 
following two steps procedure: 
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1. Computation of shifts to align the curves 
{01, ...,ej)e argmin 7 X] / 



1 



j'=i 



dt. 



(2.1) 



2. Averaging after an ahgnment step : f{t) = j X]/=i fji^ + ^j)' ^ ^ [0' 

Let us now explain how the above two steps procedure can be used to define an estimator of 
/ in model (1.3). Let 

{«*)=e-',t.[0,ll}^^^ 

be the standard Fourier basis. For legibility, we assume that n = 2N > 4 is even and we split 
the data into two samples as follows, 



for J = 1, ... , J, and 



Y^^ = Y2,,j and Y^^^ = ¥2,-1, ,q = l,...,N, 



= t and = t2g-i ,q = l,...,N. 



-q "Iq <^ii>^ ''q — <^2q~ 

For any z G C, we denote by z its complex conjugate. Then, we define the following empirical 
Fourier coefficients 

N 

< —, j = 1,...,J, 



g(0) _ 

%j ~ 


1 

iV 


N 

9=1 






1 JO) 
s/N "'^ 




1 

iV 


N 






1 (1) 




q=l 


hit^q^) 





N 

y 



N 



where 



^"S = nI1 f(*? - (^j)Mt^i^), P G {0, 1}, 



9=1 



and the -z^^j's are i.i.d. complex Gaussian variables with zero expectation and variance a''. 
Then, we define estimators of the unknown random shifts 9* as 



{9i,...,ej)e argmin M„(^i, . . . , ^j) 



where 



1 ^ 

M„(0i,...,0j) = -^ ^ 



j=l \k\<ko 



1 



(2.2) 



(2.3) 



with some positive integer that will be discussed later. The smoothed Frechet mean of / is 
then defined as 



\k\<m \ j=l 



p(m) 



J 
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where the integer m € {1, . . . , N/2} is a frequency cut-off parameter that will be discussed later 
and fj"^\t) = X]|fc|<m c[,^]0fc(i)- Note that the estimators 9j of the shifts have been computed 
using only half of the data and that the curves /j™^ are calculated using the other half of the 

data. By splitting the data in such a way, the random variables 9j and are independent 

conditionally to {9^, . . . , 9j). 

Note also that this two step procedure does not require the use of a reference template 
to compute estimators 9i,...,9j of the random shifts. Indeed, one can interpret the term 
7 X^/'=i (2-1) ^-s a template that is automatically estimated. In statistics, estimating 

a mean pattern from set of curves that differ by a time transformation is usually referred to as 
the curve registration problem. It has received a lot of attention over the last two decades, see 
e.g. [Big06, R0nOl, TIRll] and references therein. Hence, there exists a connection between 
our approach and the well known problems of curve registration and its generalization to higher 
dimensions (image warping), see e.g. [GMOl]. However, studying the minimax properties of an 
estimator of a mean pattern in curve registration models has not been investigated so far. 



3 Upper bound on the risk 

3.1 Consistent estimation of the unknown shifts 

Note that, due to identifiability issues in model (1.3), the minimization (2.2) is not well defined. 
Indeed, for any (^i, . . . , 9j) that minimizes (2.3), one has that for any 9 such that {9i +9, . . . ,9j + 
0) G [— ^, ^] , this vector is also a minimizer of M„. Choosing identifiability conditions amounts 
to impose constraints on the minimization of the criterion 



1 

Mi9u...,9j) = -jYl E 

j = l \k\<ko 



(3.1) 



where c^. A; G Z, are the Fourier coefficients of the mean pattern /. The criterion (3.1) can be 
interpreted as a version without noise of the criterion (2.3) when replacing (f^^- by c^e"*^'^'^^-' . 
Obviously, the criterion (3.1) admits a minimum at 9* = {9l,...,9*j) such that M{9*) = 0. 

However, this minimizer over [— ^5 ^] is clearly not unique. To impose uniqueness of some 
minimum of M over a restricted set, let us introduce the following identifiability conditions: 

Assumption 1. The distribution g of the random shifts has a compact support included in 
[-k/2,k/2] for some < k < 1/8. 

Assumption 2. The mean pattern f in model (1.3) is such that c\ = f{x)e~^'^'^^dx 7^ 0. 

Assumption 1 means that the support of the density g of the random shifts should be suffi- 
ciently small. This implies that the shifted curves f{t — 9j) are somehow concentrated around the 
unknown mean pattern /. Such an assumption of concentration of the data around a reference 
shape has been used in various papers to prove the uniqueness and the consistency of Frechet 
means for random variables lying in a Riemannian manifold, see [Afsll, BP03, BP05, Hucll]. 



7 



Assumption 2 is an identifiability condition to avoid the case where the function / is constant 
over [0, 1] which would make impossible the estimation of the unobserved random shifts. 
For < K < 1/8, let us define the constrained set 



Q^={ {9u...,9j) G [-k/2,k/2]^ yjj = 

Let 



1 

^° = ^1-7E^I" ^ = h---,J, andeO = (e?,...,0O). 

i'=i 

Thanks to Proposition 4.1 in [BCll], we have 

Proposition 3.1. Suppose that Assumptions 1 and 2 hold. Then, for any (9i, . . . , 9j) £ G^, 

1 ^ 

M(ei, ...,9j)- M{9l ...,9^j)> C{f, k)-Y^ - ^,T, 

i=i 

where C{f,K) = 47r^|cip cos(47rK) > 0. 

Therefore, over the constrained set 0^, the criterion (3.1) has a unique minimum at 9^ such 
that M{9^) = 0. Let us now consider the estimators 

9^ = {9l...,9^j)£ argmin M„(0i, . . . , 0j). (3.2) 

(6»i,...,ej)ee« 

The following Theorem shows that, under appropriate assumptions, the vector 9^ is a consistent 
estimator of 9^. 

Theorem 3.1. Suppose that Assumptions 1 and 2 hold. Let J > 2 and s > 2, then there exits 
a constant C > that only depends on A, s, k, c* and such that, for any f £ Ws{A,C:^), we 
have 

Inequality (3.3) shows that the quality of the estimation of the random shifts depends on 
the ratio between n and J. In particular, it suggests that the quality of this estimation should 
deteriorate if the number J of curves increases and n remains fixed. This shows that estimating 
the vector 9^ € M'^ is not a standard parametric problem, since the dimension J is is allowed to 
grow to infinity in our setting. To the contrary, if J is not too large with respect to n, then an 
estimation of the shifts is feasible at the usual parametric rate n~^. More precisely, by Theorem 
3.1, we immediately have the following result: 

Corollary 3.1. Suppose that the assumptions of Theorem 3.1 are satisfied. If ko > 1 is a fixed 
integer and J ^ for some < a < 1/6, then there exits Ci > that only depends on A, s, 
cr^, K and ko such that 

}_E3\\9'^-9Y < — ■ 
J n 

Therefore, under the additional assumption that J ^ n"', for some < a < 1/6, the vector 
9^ converges to 9^ at the rate for the normalized Euclidean norm. 
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3.2 Estimation of the mean pattern 

For p G {0, 1}, let y(P) = (fJ^^) 

1<9<A^, ^<j<J- Thanks to the estimator 0^ of the random shifts, 
we can ahgn the data Y^^^f in order to estimate the mean pattern / in (1.3). Let rrii < N/2 be 
some positive integer. For any m G {1, . . . , mi}, we recah that the estimator is given by 

i=i 

To simphfy the notations, we omit the dependency on k^, n and J of the above estimators 
and we write 

f{m) ^ j{rn)^ denote by E^^) the expectation according to the distribution of 

Y^^\ By construction, we recall that and F^^) are independent. Thus, note that is a 
biased estimator of / with respect to E^^^ since 

-^ J _ 



where we have set 

|fe|<m 

The idea of the procedure is that if the estimators 9'^ of the shifts behave well then [Z'-'"^^]) 
is small and estimating / amounts to estimate /("^i). 

To choose an estimator of /("^i) among the/(™)'s, we take a model selection approach. Before 
describing the procedure, let us compute the quadratic risk of an estimator f^'^\ 



to 




dt 






Jo 



■(m) , 



dt + 



(2171 + 1)0-2 

iVJ 



This risk is a sum of two non- negative terms. The first one is a bias term that is small when m 
is close to mi while the second one is a variance term that is small when m is close to zero. The 
aim is to find a trade-off between these two terms thanks to the data only. More precisely, we 
choose some m G {1, . . . , mi} such that 



m G argmm 

mg{l,...,m.i} 



dt + rj 



(2m + l)a'^ 
iVJ 



(3.4) 



where ry > 1 is some constant. In the sequel, the estimator that we finally consider is /„,^j = 

Such a procedure is well known and we refer to Chapter 4 of [Mas07] for more details. In 
particular, the estimator fn,j satisfies the following inequality 

-1 



]E(i) 

< Civ) 

< Civ) 







(mi) 



it) - fn,m 

1 



dt 



min E^^) 

mG{l,...,mi} 



mm 

r)i£{l,...,mi} 







f'^'Kt) - f^'^^t) 



(m) / 



dt + 



dt 

2(m + l)cr 



NJ 



(3.5) 
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where C{r]) > only depends on rj. It is known that an optimal choice for r/ is a difficult problem 
from a theoretical point of view. However, in practice, taking some r] slightly greater than 2 leads 
to a procedure that behaves well as we discuss in Section 6. 

3.3 Convergence rates over Sobolev balls 

Let us denote by [x\ the largest integer smaller than x G M. We now focus on the performances 
of our estimation procedure from the minimax point of view and with respect to the distance d 
defined in (1.4). Note that, in the subsection 3.2, we only use truncated Fourier series expansion 
for building the estimators In practice, we could use other bases of Lpg^([0, 1]) as we 

would still have a result like (3.5). In particular, the following theorem would remain true 
by combining model selection techniques with bases like piecewise polynomials or orthonormal 
wavelets to approximate a function. 

Theorem 3.2. Assume that nJ > max{21J, (4cj^)^*"'"-^/c^^} where < c < 1 is such that 
J < cn°' for some a > 0. Take mi = [N/2\ — 1 and let s > 3/2 and A > 0. Then, the estimator 
fn,J defined by the procedure (3.4) is such that 



for some C > that only depends on A, s, a^, k, /cq, i] and c. 

Therefore, using the results of Corollary 3.1 on the convergence rate of 6^ to 9^, we finally 
obtain the following result. 

Corollary 3.2. Suppose that the assumptions of Theorem 3.1 and Theorem 3.2 are satisfied. If 
kf) > 1 is a fixed integer and J ^ n°' for some < a < 1/6, then there exits C > that only 
depends on A, s, , k, ko, rj, and a such that 



4 A lower bound on the risk 

The following theorem gives a lower bound on the risk over the Sobolev ball Ws{A,c^). 
Theorem 4.1. Let us recall that 



sup sup Tlg{fn,jJ) < C {\mi\ '^'' + min 



-2s+l 




sup sup 




Tin, J {Ws{A, C^), g") = mi sup sup Tlg{fn,J,f)- 



There exists a constant C > that only depends on A, s , c* and such that 



min(n, J)— >+oo 



liminf (nJ)^7^„,J(W',(^,c,),g'') > C . 
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Therefore, under the assumption that 2as < 1, the smoothed Frechet mean converges at the 

2a 

optimal rate (n J) 2s+i as shown by the following corollary. 

Corollary 4.1. Suppose that the assumptions of Corollary 3.2 hold and assume that 2as < 1. 
Then, there exists a constant C" > that only depends on A, s, , k, /cq, r], c^, and c such that 

sup sup E3fd\[UJ,[f])) <C'inJ)-^K (4.1) 

Note that inequality (4.1) is a direct consequence of Corollary 3.2 and the fact that = 
0^(nJ)~2s+i^ in the settings 2as < 1 and J ^ n'^. 

5 Discussion 

1 23 

As explained previously, the rate of convergence r^^j = n~ + (nJ) 2s+i of the estimator fn,j 

is the sum of two terms having different interpretations. The term (nJ) ^s+i is the usual 
nonparametric rate that would be obtained if the random shifts 9\,...,9*j were known. To 
interpret the second term n~^, let us mention the following result that has been obtained in 
[BCll]. 

Proposition 5.1. Suppose that the function f is continuously differentiable. Assume that the 
density g e Q'^ with lime^_«/2 51(6*) = limg_^^ /2 g{9) = and that /^(^^ (m^°S5(^)) 9{0)d9 < 

+00. Let (01, ... , 9j) denote any estimator of the true shifts {9^, . . . , 9^) computed from the Ygj 's 
in model (1.3). Then, 




/; |/'(t)|2dt + ^2 J-V2 ^o_i^^g^e)y g^e)d9 



n- In L/ V'/l cnz-ru I 



Proposition (5.1) shows that it is not possible to build consistent estimators of the shifts by 
considering only the asymptotic setting where the number of curves J tends towards infinity. 
Indeed Inequality (5.1) implies that liminf j_5.+oo lE^ S/=i(^j ~ ^j)^) ^ ^ ^'^^ estimators 
{9i, . . . ,9j). We recall that, under the assumptions of Corollary 3.1, one has 




The above inequality shows that, in the setting where n and J are both allowed to increase, the 
estimation of the unknown shifts = ~ 7 X]m=i feasible at the rate n~^. By Proposition 
5.1, this rate of convergence cannot be improved. We thus interpret the term appearing in 
the rate r„^j of the smoothed Frechet mean [fn,j] as the price to pay for having to estimate the 
shifts to compute such estimators. 
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To conclude this discussion, we would like to mention the results that have been obtained 
in [BGIO] in an asymptotic setting where only the number J of curves is let going to infinity. 
Consider the following model of randomly shifted curves with additive white noise 

dYj{t) = fit - e*)dt + edWjit), i G [0, 1], j = 1, . . . , J with e* r^ud 9, (5.2) 

where the Wj^s are independent Brownian motions with e > being the level of additive noise. 
In model (5.2), the expectation of each observed curve dYj is equal to the convolution of / by 
the density g since 

[fit-Oj)] = J f{t-e)g{e)de = f^g{t). 

Therefore, in the ideal situation where g is assumed to be known, it has been shown in [BGIO] 
that estimating / in the asymptotic setting J — )■ +oo (with e > being fixed) is a deconvolution 
problem. Indeed, suppose that, for some v > 1/2, 

lk= [\{9)e'''^''d9>^\kr, keZ, 
Jo 

with g being known. Then, one can construct an estimator fj by a deconvolution procedure 
such that ^ 

sup / |/}(t)-/(t)|2rft<CJ-™, 
feWs{A) Jo 

for some C > that only depends on A, s and e and where Ws(^) is Sobolev ball of degree 
s > 1/2. Moreover, this rate of convergence is optimal since the results in [BGIO] show that if 
s > 2v + 1, then there exists a constant C" > that only depends on A, s and e such that 

liminf Jas+a-'+i inf sup Tl{fj,f)>C 

fj f€Ws(A) 

where the above infimum is taken over the set of all estimators fj of / in model (5.2). Hence, 

2s 

rj = J 23+2^+1 is the minimax rate of convergence over Sobolev balls in model (5.2) in the case 
of known g. This rate is of polynomial order of the number of curves J, and it deteriorates 
as the smoothness v of the convolution kernel g increases. This phenomenon is a well-known 
fact in deconvolution problems, see e.g [Fan91, PV99]. Hence, depending on g being known or 
not and the choice of the asymptotic setting, there exists a significant difference in the rates 
of convergence that can be achieved in a randomly shifted curves model. Our setting yields 
to the rate r„^j = n~ + (nJ) ^s+i (^[^ the case where J >i n'^ with a < 1/6) that is clearly 

2s 

faster than the rate rj = J 2s+2i^+i , Nevertheless, the arguments in [BGIO] also suggest that 
a smoothed Frechet mean in (5.2) is not a consistent estimator of / if one only lets Jgoing to 
infinity. Therefore, the number n of design points is clearly of primary importance to obtain 
consistent estimators of a mean pattern when using Frechet means of curves. 
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6 Numerical experiments 



The goal of this section is to study the performances of the estimator fn,j- The factors in the 
simulations are the number J of curves and the number n of design points. As a mean pattern / 
to recover, we consider the two test functions displayed in Figure 1. Then, for each combination 




0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 



(a) 



(b) 



Figure 1: Two test functions /. (a) MixtGauss: a mixture of three Gaussians. (b) 
HeaviSine: a piecewise smooth curve with a discontinuity. 



of n and J, we generate M = 100 repetitions of model (1.3) of J curves with shifts sampled from 
the uniform distribution on [— k, k] with k = 1/16. The level of the additive Gaussian noise is 
measured as the root of the signal-to-noise ratio (rsnr) defined as 



rsnr 



1 



1 _ 
2 / (fit) - ffdt where / 



f{t)dt, 



that is fixed to rsnr = 0.5 in all the simulations. For each repetition p G {1,...,M}, we 
compute the estimator fn,j,p using a gradient descent algorithm to minimize the criterion (3.2) 
for estimating the shifts. For all values of n and J, we took A;o = 5 in (2.3). The frequency 
cut-off m is chosen using (3.4) with r] = 2.5. 

To analyze the numerical performances of this estimator, we have considered the following 
ideal estimator that uses the knowledge of the true random shifts 0* ^ (sampled from the p-th. 
replication) 

fT> = E fjEC,/^'^'^- 1 Mt), t E [0,1] . 

|fc|<m \ i=l 

The frequency cut-off for the above ideal estimator is chosen using a model selection procedure 
based on the knowledge of the true shifts i.e. 



m* G argmin 

mS{l,...,ini} 
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with r] = 2.5. 

Then, we define the relative empirical error between the two estimators as 

1 ^ 

Rin, J) = 

p=i 

In Figure 2, we display the ratio R{n^ J) for various values of n and J and for the two test 
functions displayed in Figure 1. It can be seen that the function J i— )• R{n,J) is increasing. 
This means that the numerical performances of the estimator fn,J deteriorate as the number 
J of curves increases and the number n remains fixed. This is clearly due to the fact that the 
estimation of the shifts becomes less precise when the dimension J increases. These numerical 
results are thus consistent with inequality (3.3) in Theorem 3.1 and our discussion on the rate of 
convergence of fn,j in Section 3. On th opposite, the function n i— )• R{n, J) is decreasing and it 
confirms that the number n of design points is a key parameter to obtain consistent estimators 
of a mean pattern / with Frechet means of curves. 




(a) (b) 

Figure 2: Relative empirical error R{n, J) for various values of n (vertical axis) and 
J (horizontal axis) over M = 100 replications: (a) MixtGauss, (b) HeaviSine. 



A Proof of the main results 

Throughout the proofs, we repeatedly use the following lemma which follows immediately from 
Lemma 1.10 in [Tsy09]. 

Lemma A.l. If f G VFs(A, c*) then there exists a constant Aq > only depending on A and s 
such that 

max ^ leg - cfce-*2.fce;| < ^ ^ 

- — <k< — 



14 



for all 1 < i < J. 



A.l Proof of Theorem 3.1 

For legibility, we will write E = E^, i.e. we omit the dependency on g of the expectation. The 
proof is divided in several lemmas. Let || • || denote the standard Euclidean norm in M"^. First, 
we derive upper bounds on the second, fourth and sixth moments of H^*^ — 0^\\. 

Lemma A. 2. Let N > 2, J > 1 and I < ko < N/2. We assume that Assumptions 1 and 2 
are satisfied and we suppose that s > 3/2. Then, we have the following upper bounds, for any 

feWs{A,c,), 



J 



1/2 



-/2 



-^(\\e^-eX) <C2— 

J \ J n 



and 



1 



-E 



30 /)0||6 



J~ V"" - u J - -o ^3/2 > 

where Ci, C2 and C3 are positive constants that only depend on A, s, c*, cr^ and n. 
Proof. Let / G Ws(A, c^,). Since 6^ = {61, . . . , ^j) is a minimizer of M„, it follows that 

M(^°) - M(6'°) < 2 sup \Mn{e) - M((9)|. 



Therefore, by Proposition 3.1, we get 



-E||^O-0Of 

J 



< 2C-^{c^,k)E ( sup \Mn{e) - M{e)\ 



^E||6'"" - ^"f < AC-'\c^,k)JK I sup \Mn{9) - M(0)|' 



and 



1 



E||0^ - e'^f < 8C7-^(c,, k) J^E sup \Mn{e) - M{e)\' 



where we have set C(c*, k) = 47r^c^ cos(87rAt). 

Let G and note that Mn{6) can be decomposed as 

Mn{e) = M{e) + Q{e) + L{e), 



(A.l) 
(A.2) 

(A.3) 



(A.4) 
(A.5) 

(A.6) 



(A.7) 



where 



j = l \k\<ko 



-(0) 2ikn9j 



(0) jiikTve^i 
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j = l \k\<ko 



1 



,e y 



i'=i 



and 



L(^) 



7 



Using Lemma A.l, it follows that, for any 9 G 0^, 
J 



me) -Mm < \y.T. 



J 



j=l \k\<ko 



(0) 



J 



-2ikTTe*,s2ikTTej, 



j'=l 



< 2^oA^-""^i/2 ^ (^4|cfc| +2AoA^-"^i/2 
|fc|<fco 



V |fc|<fco 

Hence, there exists a positive constant C that only depends on A and s such that 

sup \M{9) - M{e)\ < Ckl''^N~'^^/'^ . 



(A.8) 



Now, note that Q{e) < fjZ with Z = E\k\<k,EU 4°]/^ 
follows that 

E sup \Q{e)\ < a^{2ko + l)N-\ 



and 



E sup \Q{e)\^ < 2a^{2kQ + lfN~^. 



for any 9 £ G^. Thus, it 
(A.9) 

(A.IO) 



By Jensen's Inequality, we get EZ^/"^ < (EZ^)^/^ and, since EZ"^ < 2j'^{2ko + 1)^, we obtain 



Esup \Q{e)\y'<8'/'^{2ko + lf/'. 



(A.ll) 



Finally, using EZ^ < 6J'^(2ko + 1)^, we have 



Esup \Q{e)\^ < 6j^{2ko + ly . 



(A.12) 
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By Cauchy-Schwarz's Inequality 

L{e) < 2^ M {9)^/0(9) . (A.13) 

Thanks to Lemma A.l, we get 

^^W^7 E Eht]f < E \c,\' + AlN^^-^^\2ko + l). 

Thus, it follows from (A. 9), (A. 10), (A. 11) and (A.13) that there exists a positive constant C, 
only depending on A, s and cr^, such that 



E sup \L{e)\ < C'kl^^N-^/^ , (A.14) 

/2, 



E sup |L(0)|2 < c"koN^^ (A.15) 
and 

E sup \L{9)f < C'^kl^^N-^/^ . (A.16) 
Since s > 3/2, we obtain, by Inequalities (A. 8), (A. 9) and (A.14), 

E ( sup \Mn{e) - M{9)\] < C^/fc^ , 

by Inequalities (A.8), (A. 10) and (A.15), 



E ( sup \Mn{9) - M{9)\A < C2koN-^ , 

and by Inequalities (A.8), (A. 12) and (A.16), 

E ( sup \Mn{9) - M{9)A < C'^kl^^N-^/^ , 

where C[, and Cg are positive constants that only depend on A, s and cj^. Combined 
with Inequalities (A. 4), (A. 5) and (A. 6), the announced result follows from the above upper 
bounds. □ 

In order to prove Theorem 3.1, we divide the rest of the proof in the three following steps. 
In the sequel of this section, we always assume that the hypotheses of Theorem 3.1 are satisfied 
and we use the decomposition of Mn{0) as defined in (A. 7). 

Step 1 : there exists some positive constant Ci that only depends on c* such that 

<Ci fnJ||VAf„(0°)f +nJsup Wv"^ Mn{9) - M {e^)f (A.17) 
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where V and denote the gradient and the Hessian operators, respectively, and where we have 
set 

= 1^ G e« such that 11^ - e^w < \\e° - ^°||} 

and, for any J x J matrix B, the operator norm ||i?||op is defined by 



lollop = sup 



\B9\ 
W 



6»eMJ\{0} 

Step 2 : there exists some positive constant C2 that only depends on A, s and cr^ such that 



nJK\\VMn{ 



aOMi2 



(A.18) 



Step 3 : there exists some positive constant C3 that only depends on A, s, k, c* and cr^ such 
that 

nJE (sup \\VHU0) - ^'M{6^)\\1 - 6^) < C3 (l + ^) . (A.19) 

The result announced in Theorem 3.1 follows from Inequalities (A. 17), (A.18) and (A.19). 



A. 1.1 Proof of Step 1 

The gradients of M{9), Q{0) and L[9) follows from easy computations. We have, for any 
KK J, 



de 



\k\<ko 
k\<ko 



'-k,e' 



(A.20) 
(A.21) 



and 



47r 



\k\<ko 



E 

(A.22) 

Similarly, we can compute the Hessians of these functions as follows, for 1 < < J, ii i ^ i', 



87r2 

J2 



\k\<ko 



Q{0) 



iVj2 



7,(0) -{0) 2ik-K{e^,-ei) 

^kl^kl'^ 



MM j,ikiT{e,,-et) 

'^kl'^kl'^ 



\k\<kQ 



(A.23) 
(A.24) 
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and 



a2 



and, ii£ = £' 



m 



jViv 



\k\<ko 



JO) (0) 2ik7T{e,,-ee) , ^(0)^(0) 2ik7r{ee-0,,) 
'-k,e''^kl^ ~^ '-kl^kl'^ 



(A.25) 



deedde 
_SP_ 



|fc|<fco 



J 



iVj2 



fc|<feo 



J 



^io)^2ikTTej 



(A.26) 



(A.27) 



and 



a2 



8^^ 



.PVN 



fc|<fco 



-(O)„2ifc7r0f 



W^2ikiTej 



_|__j,(0)g2jfc7r9<. 



J 



-(0) 2jfc7r6»j 



(A.28) 



Using the fact that 9^ S 0^ is a minimizer of M„, so VM„(^'^) = 0, a Taylor expansion of 
■> VMn{0) with an integral form of the remainder term leads to 



{) = vMn{e'')+ / v'Mn{e{t)){9'' -e'')dt 



(A.29) 



where, for any t G [0, 1] , we have set 

m 

Thus, we have 



3O 1 +(aG /lO 



V^M{9^W - n = -yMn{9^) - / [V^Mn{9{t)) - V^M{9^)) {9° - 9^)dt . (A.30) 

Jo 

It follows from similar computations as we did for M that 

\k\<kQ 

where Ij is the J x J identity matrix and Ij denotes the J x J matrix with all entries equal to 



one. Therefore, using the fact that 



J J 



0, we obtain 



\\V'^M{9°){9'^ -9°)f 



647r^ 



E k'\ck\' 1 11^"° 

\k\<ko 
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and it shows that there exists a constant C > that only depends on c* such that 



(A.31) 



Then, Inequahty (A. 17) foUows from (A. 30) and (A.31). 
A. 1.2 Proof of Step 2 

By using Lemma A.l, for any 1 < A; < /cq and 1 < £ < J, we can expand 

with \ak/\ < AqN~^'^'^/'^ . Because, for any j, 9j — 9* = 9j does not depend on j, we have 

J 



Thus, by Equation (A. 20) and using Cauchy-Schwarz's Inequality, we obtain 



_d_ 

d9f 



M{9^) 



< 



< 



J4 

327r2 
647r2 



|fc|<fco 



• < E4' 



(0) 2ifc7r6lj 



\k\<ko 

{2ko + l)AlN-^'+^{A'^ + AgA;|]A^-2^+i) . 



Thus, there exists a positive constant C that only depends on A and s such that 

nJ\\VM{9^)f < Cko (1 + kln-^'+^) n-^'+^ 
We now focus on VQ and, by (A. 21), we can obtain 



(A.32) 



e=i 



< 



< 



i=l \k\<ko 

167r2(J-l)2 ^ 



(0) 2ifc7r0O 



327rVfc| 



1=1 \k\<ko 



Hence, we get 



n 



(A.33) 
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E 



Finally, we deal with VL. Equation (A. 22) and Lemma A.l imply 

J 



167r^ 



< 



< 



< 



|fc|<fco 



fc|<fco 



2 



-(o)^2ik-Kej 



-{0) 2ikn0e 

'-k,e^ 



(0)g2ifc7r6»j 



+ J 



.(0) 



J 



:{0) 



j2iV 



+ J 
327r2o-2 



.(0) 

"kl 



l2Ar-2^* + l 



|fc|<fco \ i = l / |fc|<A:o 

This last inequality leads to 

nJE\\VL{e°)f < 647r2a2(^2 ^ 2^^) (l + ytj^n-^^+i) . (A.34) 
By combining Inequalities (A. 32), (A. 33) and (A.34), we then obtain Inequality (A. 18). 
A. 1.3 Proof of Step 3 

We introduce the Frobenius norm defined, for any J x J matrix B = [-B£/']i<£/'<j, as 



\B\ 



We also recall the classical inequalities 



l-B|L„ < II-BIIf and ll-BllnTi < max > \Bc 
I nop - II 11^ II wop - ^<^,<^z^i 

- - 1=1 



In order to prove Inequality (A. 19), we use the following decomposition 

\\V^Mn{e)-V^M{e'^)\\% < A{\\V^M{9)-V^M{e)fp 



op 



+\\v^Q{e)fp + \\v^m\\l) 



(A.35) 



(A.36) 



We now deal with each term in the above inequality. Hereafter, i and £' always denote two 
integers in {1, . . . , J}. 
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First, let us consider I ^ by (A. 23) and Lemma A.l, we get 



92 _ 52 

M{0) - ^^Mi9) 



< 



< 



647r4 
647r^ 
2567r'^ 



fc|<fco 



fc|<fco 



^(O)-(O) _ , ,2 2ikn(9;,-9*) 



fc|<fco 



In the case of £ = (A. 26) and Lemma A.l lead to 



< 



< 



647r^ 
2567r^ 



|fc|<fco 



■2s+l 



Therefore, the above inequalities and (A.l) imply that there exists some positive constant Cm 
that only depends on A, s, k and cr^ such that 



nJE ( sup \\V'^M{6) - V'^M{e)\\p \\ff 



2 11^0 o0\\2 



Secondly, using the fact that 2(1 — cos(t)) < for any t G M, if £ 7^ then we have 



(A.37) 



M{0) - -,^M{e') 



ddffdee 



< 



8^ 
167r^ 



\k\<ko 



j2ik-K{ei-e°+e°-et,) 



1 



\k\<ko 



and if ^ = ^' then we obtain 



92 



■M{e) - M{ff 



< 



87r2 

J2 

16^^ 



|fc|<fco 

E k\k\' 

\k\<ko 



J 



2ikn(ee~e'^-ej+e°) 



E I 
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Therefore, by (A. 35) and under the condition s > 2, we get 



lop 



< 



< 



327r4^2 



max 



J2 i<e<j 



E 



"^3 + 



q0|2 



.4 /l2 



7 



■ lie 



n0||2 



Thus, by definition of Z//^ and by (A. 3), we know that there exists some C'j^ > that only 
depends on A, s, cr^ and k such that 



< c 



^,3/2 j2 
1/2 • 



n 



Next, we deal with the term relative to ||V^(5(0)||p. Let us begin by noting that 
4Jk?. Thus, we have 



E ( sup ||V^Q(e)|||||^° - eY ] < 4Jk'E ( sup \\V'Q{e)Wj, 



If we take i ^ i' then, using (A. 24), we get 



(A.38) 



< 



(A.39) 



-Q{0) 



dOedOi 

and \i I = then, by (A. 27), we have 



< 



647r^ 



Ak\<kQ 



2| J0)|| (0) I 



52_ 



Q{0) 



< 



64^ ^ / ^ A.2u(0)||J0)| 



2u(0)| 



Ak\<ko 



J 



(0) 2iTTkej 



i=ijV^ \|fc|<fco 

Hence, Cauchy-Schwarz Inequality leads to the following upper bound 

A J J 



E(sup||V=QW|||) < 



EE E E * 



2u(0)|, (0) 



< 



< 



wE E e( j:.=i4°]i=|e( 



|fc|<fco 
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Combining this bound with (A. 39) gives us some Cq > that only depends on k and cj^ such 
that 

nJE f sup \\V^Q{e)\\l\\e'^ - < . (A.40) 



n 



Finally, we focus on the term concerning ||V^L(^)|||,. By Cauchy-Schwarz Inequality and 
(A. 2), we have 



E{snp\\V^L{9)\\Ue''-eYj < WE(^sup||V2L(^)|||)^/E(||eo_0O||4 



< ^E(sup||V2L(0)||4^y^ 
Using Lemma A.l and (A. 25), ii i ^ i' we obtain 
52 



n 



de 1,001, 



■m 



647r' 



fe|<fco 



JO)^(O) 



+ 



JO) ^(0) 



-s+l/2 



fc|<fco 



,(0) 
^kl 



+ 



,(0) 
^kl' 



and, by (A.28), if ^ = f , we get 



(A.41) 



dOpdO, 



■m 



J 



J 



:(0)| 



< 



647r4 



,{0)| 



M<ka 
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Hence, we upper bound the expectation in (A. 41), 



E( sup ||V^L(^)||^ 



J 



e=l \|A:|<fco 



+7E E k'[\-k\+A,N 



El E^M(N+^oiv-^"^/^)|^i41i+ E 



2i 2N 



,(0) 



+ 



k\<ka 



,(0) 



(0), 



E I'fcj 



2- 2 



< 



< 



< 



J 

i=l \|fc|<feo 

4 X 40967r8 
j2iV2 

4 X 409671^ 



-s+l/2 



,(0) 



\k\<ko 



El 



(0), 



JV2 

168 X 40967rW2A;^ 



k\<ko 



fc|<fco i=i 
2 



(0)|2 



Ar2 



^1 Y k'[\c,\+A,N~^+^l^ 

\k\<ko 



Therefore, there exists some constant > that only depends on A, s and cj^ such that 



Efsup \\V^L{e)\\%\<C'J^ 



2 / N 2 

1+ ^0 



2s-l 



(A.42) 



Using (A. 41) and (A.42), we know that there exists some constant > that only depends on 
2 



c^=, K, j4, s and a such that 



nJE( sup ||v2L(0)|||||e"O-eOf\ < c^M^ 



1 + 



1,5 







n 



2s-l 



(A.43) 



Finally, we use (A.37), (A.38), (A. 40) and (A.43) with (A.36) to get (A. 19). 
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A.2 Proof of Theorem 3.2 

Let us assume that / G Ws{A,c^,). We upper bound the distance between / and fn,j, 



d\[f],[fn,j]) 



inf 

ee[~i/2,i/2] Jq 



f{t-e)-fnAt) 



dt 



< 2(i2([/],[/(-i)]) + 2 / /(-i)(t)-/„,^(t) 



dt . 



Taking the expectation according to the distribution of Y^^^ on both side and using (3.5) leads 
to 



< 2dH[f], [/(™^)]) + 2C{r,) min | /' - f^^\t) 

me{l,...,m.i} Uq 



2 , 2(m + 1)0-2 

dt+ ^ ' 



< 2d2([/],[/(-^)])+2C(r?) min ^ 

me{l,...,mi} f-^ 

m<\k\<mi 



J 



1 -(1) 2i7rfce0 



+ 



NJ 



2{m + 1)^2 



J-A.44) 



Let = {91 + • • • + 6j)/J, we recall that 9^ = 9* — 9j. We begin by upper bounding the 
first term. Thanks to Jensen's inequality, we obtain 



dt 



fe|>mi |fe|<mi 



|fc|>»ni \k\<mi j=l 



J 



Since / S Ws{A,c^,), we easily upper bound the bias part 



-2iTTkej _ -{i)2i-Kke° 



E |cfc|'<^|mi|- 

|fe|>mi 



■2s 



(A.45) 



(A.46) 
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To deal with the other part, we spht it into two sums, 



J 



< 



< 



< 



J 

2|cfc| 



2 ^ , 



2 2 



2i-Kk[e°-6') -(1) 2i7rfceO 



g2i7rfc(eO-eO) 



2 2 v^^ 

+ 7^ 



-(1) -2i-Kke* 



J 



(A.47) 



where the last inequality follows from Lemma A.l and from 2(1 — cost) < t^, t € M. Combining 
(A.45), (A.46) and (A.47), we get, for any g gG'^, 



d\[f],[f"'M < A\m^\~^' + 2Ali2m, + l)N- 



|fc| <mi 



■2s+l 



J 



4e 



(A.48) 



We now focus on the second term in (A. 44). Let akj = c^e — cj}\ . Using Jensen's 

inequality and Lemma A.l, for any m G {1, . . . , ?ni}, we have 



E 

m<\k\<mi 



-L -(1) 2i7rA;6'0 



Let us consider m* such that 



< 



E 7Eh^^ 

m<|fc|<r?ii i=l 



-2j7rfc6» 



J 



|fe|>m m<\k\<mi j=l 

< 2^m-2^ + 4Agmi Af-2s+i ^ 

l/(2s+l) 



where c is the constant such that J < cn°'. Note that such a choice is allowed because it is such 
that G {1, . . . , mi} since n > 21, s > 3/2, a € (0, 1/6] and c G (0, 1). In particular, such a 
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choice leads to the following upper bound 



mm 

m£{l,...,mi} 



E 

^ m<\k\<mi 



1 -(1) 2i7Tk0" 



+ 



2(m + l)cr^ 

Wj 



< 4^2 A,-2.+i ^ ^ 2 min + ^ 

NJ me{l,...,mi} 1^ NJ 

< 4AlmiN-^'+' + (^1 + + 4^2)^2./(2.+i)^ (^^p2./(2.+i) _ 

Putting (A.48) and (A. 49) in (A. 44) leads to, for any g 

ng{fn,jj) < A\mi\-^' + AAl{{l + C{v))mi + l)N-^'+^ 



■2s/(2s+l) 



(A.49) 



, |fc|<mi 



+C(7?) (l + (A + 4<t2)c2^/(2''+i)) (nJ)"2'^/(2^+i) , 
that completes the proof using the fact that mi = [A^/2j — 1. 

A. 3 Proof of Theorem 4.1 

The arguments that we use to derive this result are based on Assouad's cube Lemma (see e.g. 
[Tsy09]). This Lemma is classically used in nonparametric statistics to derive lower bounds on 
a risk. We will show that one can construct a set of functions J^q C Ws(^, c*) such that there 
exists a constant C > (only depending on A, s, c* and o"^) such that, for any large enough n 
and J, 

TZn,j{W,iA,c,),g^) > inf sup sup Up^CU^jJ) > C{njy^^ , (A.50) 

where fn,j denote some estimator of /. For the sake of legibility, we assume in the sequel that 
c* = 1. Let 



^0= I fw ■■ [0, 1] M; /^(t) = ^Hn,j V Wk(f)kit), Wk G {-1, 1}, w^k = Wk 



where Kn,j = {fc G Z, < |A;| < Dn^j}, fin, J is a positive real and Dn^j is a positive integer 
that will be specified below. Let us introduce the notation Q = { — 1, l}^"--' and note that any 
function E is parametrized by a unique element w £ Q. Under the condition 



Hn,J = cDj' \ with c< ^ , 



(A.51) 



it can easily be checked that J^q C VF5(A, c^,). In what follows, Dn,j is chosen as the largest integer 
smaller that (nJ) 2s+i . Hereafter, E^^ will denote the expectation with respect to the distribution 
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Ffo of the random vector (Yf j)i<£<n,i<j<J G ^""^ iii model (1.3) under the hypothesis that 
f = fw and the assumption that the shifts are i.i.d. random variables with density g £ G'^. Note 
that for any g £ Q'^ 



sup 7^g(/„,J,/) = sup E 



inf 

.ee[o,i] \Jo 



\fn,jit-0)-fit)\^dt 



> 



> 



wen 



inf 

.ee[o,i] \Jo 



\fnAi-G)- Ut^dt 



1171 ^ 



inf > Cfc 



„ 2ikn9 /—, 



where Cfc = fn,j{t)<Pk{t)dt is the fc-th Fourier coefficient of Now, we consider, for k G i^n.j 
and 6* G [0,1], 



Wk,e e argmin \cke 



-2ikne 



y/lJ'n,jv\ . 



We have the inequality 



\y/¥^Wk,e - y/fJ-n,jWk\ < ^^'"'^ - y/JI^Wkfi\ + jcfcC ''"""^ - ^/^J■n,JWk 



2ikne 



that implies 



inf |u'fc,0-u;fc| 



For G and A; G Kn^j, we define t;;'''^-* G such that, for any £ ^ k, w^^^ = W£ and w'j^' = —Wk- 
Then, it follows that 



sup ng{fn,J , f) > 



4101 



k&K^ j w£fl\wk = l 



(A.52) 



where we have set 

Rk = K 



inf 



+ E^ 



inf (\wk,e + Wk\ 
ee[o,i] 



Let ^* = {91, . . . , we introduce the notation E^* to denote expectation with respect to the 
distribution of the random vector (ygj)i<£<n,i<j<j G M""^ in model (1.1) conditionally to 
61, ... ,6 J. Hence, using these notations, we have 



.1 iiJ 

2' 2' 



Rk{e*)g{ei)...g{e*j)dei...de*j 



(A.53) 



where 



Rk{e*)=K 



inf \wkfi - Wk\ 

.fG[0,l] 



[k) 



inf \\wk,d + Wk\ 

.fG[0,l] 
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Now, note that for any < 5 < 1, 



( 2\ f 2\ ^ (fc) 

\nf [\wk,e-Wk\ + u\i [\wk,e + Wk\ Jfn, {Y) 



> 4E3mm > AS. 



{Y)>5 



(A.54) 



where Y G W^"^ is the random vector obtained from the concatenation of the observations from 
model (1.3) under the hypothesis f = fw and conditionally to 61,... ,9j. Because Wk = 1, we 
know that 



(Y) 



I E E - ^i) - /»(*^ - ^1)1' + (/-(*^ - ^1) - - 0*)) 



j=i i=i 



\ i=i£=i / 



Therefore, log ^ (Y) j is a random variable that is normally distributed with mean —-^nJfXnj 

and variance ■^nJfin,J- Now, since Dnj is the largest integer smaller than (nJ) 2s+i , it follows 
from equation (A. 51) that < nJfin.j < 2A, for any large enough n and J. Thus, there exists 
< 5 < 1 and a constant cs > (only depending on A, and (5) such that 



{Y)>b\> cs. 



Combining this inequality with (A. 52), (A. 53) and (A.54) lead to 



sup 7^g(/„,,J,/) > 



^ C5 > 6csfln,jDn,J- 



(A.55) 



— 2 —1 1 

Since = C-D,^ j and Dnj < (nJ) 2^+1 , it follows that 

which combined with (A.55) proves inequality (A. 50) and completes the proof of Theorem 4.1. 
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